SemanticScuttle - klotz.me » Tags: hugging face+machine learning

Tags: hugging face* + machine learning*

0 bookmark(s) - Sort by: Date ↓ / Title /

This page details the command-line utility for the Embedding Atlas, a tool for exploring large text datasets with metadata. It covers installation, data loading (local and Hugging Face), visualization of embeddings using SentenceTransformers and UMAP, and usage instructions with available options.

2025-08-13 Tags: embedding, text, data, visualization, umap, sentence transformers, command line, hugging face, parquet, duckdb by klotz

Primer LLM Embedding

This Space demonstrates a simple method for embedding text using a LLM (Large Language Model) via the Hugging Face Inference API. It showcases how to convert text into numerical vector representations, useful for semantic search and similarity comparisons.

2025-03-28 Tags: llm, embedding, hugging face, inference, api, semantic search, vector representation, text embedding by klotz

A Coding Implementation to Build a Document Search Agent (DocSearchAgent) with Hugging Face, ChromaDB, and Langchain

This tutorial demonstrates how to build a powerful document search engine using Hugging Face embeddings, Chroma DB, and Langchain for semantic search capabilities.

2025-03-21 Tags: document, search, hugging face, chromadb, langchain, vector database, embedding, agents, llm by klotz

Hugging Face Clones OpenAI’s Deep Research in 24 Hours

Hugging Face researchers developed an open-source AI research agent called 'Open Deep Research' in 24 hours, aiming to match OpenAI's Deep Research. The project demonstrates the potential of agent frameworks to enhance AI model capabilities, achieving 55.15% accuracy on the GAIA benchmark. The initiative highlights the rapid development and collaborative nature of open-source AI projects.

2025-02-06 Tags: hugging face, openai, deep research, agent, benchmark, machine learning, llm by klotz

Open-R1: a fully open reproduction of DeepSeek-R1

Hugging Face's initiative to replicate DeepSeek-R1, focusing on developing datasets and sharing training pipelines for reasoning models.

The article introduces Hugging Face's Open-R1 project, a community-driven initiative to reconstruct and expand upon DeepSeek-R1, a cutting-edge reasoning language model. DeepSeek-R1, which emerged as a significant breakthrough, utilizes pure reinforcement learning to enhance a base model's reasoning capabilities without human supervision. However, DeepSeek did not release the datasets, training code, or detailed hyperparameters used to create the model, leaving key aspects of its development opaque.

The Open-R1 project aims to address these gaps by systematically replicating and improving upon DeepSeek-R1's methodology. The initiative involves three main steps:

1. **Replicating the Reasoning Dataset**: Creating a reasoning dataset by distilling knowledge from DeepSeek-R1.
2. **Reconstructing the Reinforcement Learning Pipeline**: Developing a pure RL pipeline, including large-scale datasets for math, reasoning, and coding.
3. **Demonstrating Multi-Stage Training**: Showing how to transition from a base model to supervised fine-tuning (SFT) and then to RL, providing a comprehensive training framework.

2025-01-28 Tags: open-r1, deepseek-r1, hugging face, reinforcement learning, llm, open source by klotz

How to Build a Text Classification Model using Hugging Face Transformers

A detailed guide on creating a text classification model with Hugging Face's transformer models, including setup, training, and evaluation steps.

2024-12-17 Tags: text classification, hugging face, transformers, machine learning, nlp by klotz

HunyuanVideo: A Systematic Framework For Large Video Generation Model Training

HunyuanVideo is an open-source video generation model that showcases performance comparable to or superior to leading closed-source models. It includes features like a unified image and video generative architecture, a large language model text encoder, and a causal 3D VAE for spatial-temporal compression.

2024-12-05 Tags: hunyuanvideo, text-to-video, llm, hugging face, tencent, machine learning by klotz

Gradio 5 is here: Hugging Face’s newest tool simplifies building AI-powered web apps

Hugging Face launches Gradio 5, a major update to its popular open-source tool for creating machine learning applications, aimed at making AI development more accessible and secure for enterprises.

2024-10-10 Tags: gradio, hugging face, machine learning, webdev, ui by klotz

How to Fine-Tune BERT for Sentiment Analysis with Hugging Face Transformers

This tutorial covers fine-tuning BERT for sentiment analysis using Hugging Face Transformers. Learn to prepare data, set up environment, train and evaluate the model, and make predictions.

2024-06-06 Tags: bert, sentiment analysis, hugging face, transformers, natural language processing, machine learning, pytorch, data science by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: hugging face* + machine learning*

Linked Tags

Related Tags